14 research outputs found
Recommended from our members
Exploiting patterns in genomic data for personalised cancer treatment and new target discovery
In response to a global requirement for improved cancer treatments a number of promising novel targeted cancer therapies are being developed that exploit vulnerabilities in cancer cells that are not present in healthy cells. In this thesis I explore different ways of identifying the vulnerabilities of cancer cells, with the ultimate aim of providing personalised therapies to cancer patients on an individual basis.
I first investigate approaches that utilise the concept of synthetic lethality. Therapies that exploit synthetic lethality are suitable where a specific tumour suppressor has been inactivated by a cancer and an identified synthetic lethal (SSL) pair for that gene may be therapeutically targeted.
Mainly due to the constraints of the experimental procedures, relatively few human SSL interactions have been identified. Here I describe computational systems approaches for predicting human SSL interactions by identifying and exploiting conserved patterns in protein-protein interaction (PPI) network topology both within and across model species. I report that my classifiers out-perform previous attempts to classify human SSL interactions. Experimental validation of my predictions suggest they may provide useful guidance for future SSL screenings and ultimately aid targeted cancer therapy development.
All predictions from this study have been made available via a new online database that I designed, built and published.
As an extension to this approach I used similar network features to predict gene dependencies, otherwise known as acquired essential genes, in specific cancer cell lines. Genetic alterations found in each individual cell line were modelled using the novel approach of removing protein nodes to reflect loss of function mutations and changing the weights of edges in each protein-protein interaction network to reflect gain of function mutations and gene expression changes.
I report that base PPI networks can be used to successfully classify human cell line specific gene dependencies within individual cell lines, between cell lines and even across tissue types. Furthermore, my personalised PPI network models further improve prediction power and show improved sensitivity to rarer gene dependencies, an improvement which offers opportunities for personalised therapy. In a therapeutic context these essential genes would be suitable as individual drug targets for each specific patient.
Finally, I analyse copy number variance and ploidy in a set of cancers from kidney patients. Using clustering algorithms I investigate patterns in cancer cell line arm-wise ploidy and identify factors that may be driving this genomic instability
Recommended from our members
Defining signatures of arm-wise copy number change and their associated drivers in kidney cancers
Using pan-cancer data from The Cancer Genome Atlas (TCGA), we investigated how patterns in copy number alterations in cancer cells vary both by tissue type and as a function of genetic alteration. We find that patterns in both chromosomal ploidy and individual arm copy number are dependent on tumour type. We highlight for example, the significant losses in chromosome arm 3p and the gain of ploidy in 5q in kidney clear cell renal cell carcinoma tissue samples. We find that specific gene mutations are associated with genome-wide copy number changes. Using
signatures derived from non-negative matrix factorisation (NMF), we also find gene mutations that are associated with particular patterns of ploidy change. Finally, utilising a set of machine learning classifiers, we successfully predicted the presence of mutated genes in a sample using arm-wise copy number patterns as features. This demonstrates that mutations in specific genes are correlated and may lead to specific patterns of ploidy loss and gain across chromosome arms. Using these same classifiers, we highlight which arms are most predictive of commonly mutated genes in kidney renal clear cell carcinoma (KIRC)
Computational approaches to identify genetic interactions for cancer therapeutics
The development of improved cancer therapies is frequently cited as an urgent unmet medical need. Here we describe how genetic interactions are being therapeutically exploited to identify novel targeted treatments for cancer. We discuss the current methodologies that use ‘omics data to identify genetic interactions, in particular focusing on synthetic sickness lethality (SSL) and synthetic dosage lethality (SDL). We describe the experimen- tal and computational approaches undertaken both in humans and model organisms to identify these interac- tions. Finally we discuss some of the identified targets with licensed drugs, inhibitors in clinical trials or with compounds under development
Recommended from our members
'Big data' approaches for novel anti-cancer drug discovery
Introduction: The development of improved cancer therapies is frequently cited as an urgent unmet medical need. Here we review how recent advances in platform technologies and the increasing availability of biological ‘big data’ are providing an unparalleled opportunity to systematically identify the key genes and pathways involved in tumorigenesis. We then discuss how these discoveries may be amenable to therapeutic interventions.
Areas covered: We discuss the current approaches that use ‘big data’ to identify cancer drivers. These approaches include genomic sequencing, pathway data, multi-platform data, identifying genetic interactions such as synthetic lethality and using cell line data. We review how big data is being used to assess the tractability of potential drug targets and how systems biology is being utilised to identify novel drug targets. We finish the review with an overview of available data repositories and tools being used at the forefront of cancer drug discovery.
Expert opinion: Targeted therapies based on the genomic events driving the tumour will eventually inform treatment protocols. However, using a tailored approach to treat all tumour patients may require developing a large repertoire of targeted drugs
Recommended from our members
Identification and analysis of mutational hotspots in oncogenes and tumour suppressors
Background: The key to interpreting the contribution of a disease-associated mutation in the development and progression of cancer is an understanding of the consequences of that mutation both on the function of the affected protein and on the pathways in which that protein is involved. Protein domains encapsulate function and position-specific domain based analysis of mutations have been shown to help elucidate their phenotypes.
Results: In this paper we examine the domain biases in oncogenes and tumour suppressors, and find that their domain compositions substantially differ. Using data from over 30 different cancers from whole-exome sequencing cancer genomic projects we mapped over one million mutations to their respective Pfam domains to identify which domains are enriched in any of three different classes of mutation; missense, indels or truncations. Next, we identified the mutational hotspots within domain families by mapping small mutations to equivalent positions in multiple sequence alignments of protein domains
We find that gain of function mutations from oncogenes and loss of function mutations from tumour suppressors are normally found in different domain families and when observed in the same domain families, hotspot mutations are located at different positions within the multiple sequence alignment of the domain.
Conclusions: By considering hotspots in tumour suppressors and oncogenes independently, we find that there are different specific positions within domain families that are particularly suited to accommodate either a loss or a gain of function mutation. The position is also dependent on the class of mutation.
We find rare mutations co-located with well-known functional mutation hotspots, in members of homologous domain superfamilies, and we detect novel mutation hotspots in domain families previously unconnected with cancer. The results of this analysis can be accessed through the MOKCa database (http://strubiol.icr.ac.uk/ extra/MOKCa)
Mutational patterns in oncogenes and tumour suppressors
All cancers depend upon mutations in critical genes, which confer a selective advantage to the tumour cell. Knowledge of these mutations is crucial to understanding the biology of cancer initiation and progression, and to the development of targeted therapeutic strategies. The key to understanding the contribution of a disease-associated mutation to the development and progression of cancer, comes from an understanding of the consequences of that mutation on the function of the affected protein, and the impact on the pathways in which that protein is involved. In this paper we examine the mutation patterns observed in oncogenes and tumour suppressors, and discuss different approaches that have been developed to identify driver mutations within cancers that contribute to the disease progress. We also discuss the MOKCa database where we have developed an automatic pipeline that structurally and functionally annotates all proteins from the human proteome that are mutated in cancer
Analysis of Archived Residual Newborn Screening Blood Spots After Whole Genome Amplification
Deidentified newborn screening bloodspot samples (NBS) represent a valuable potential resource for genomic research if impediments to whole exome sequencing of NBS deoxyribonucleic acid (DNA), including the small amount of genomic DNA in NBS material, can be overcome. For instance, genomic analysis of NBS could be used to define allele frequencies of disease-associated variants in local populations, or to conduct prospective or retrospective studies relating genomic variation to disease emergence in pediatric populations over time. In this study, we compared the recovery of variant calls from exome sequences of amplified NBS genomic DNA to variant calls from exome sequencing of non-amplified NBS DNA from the same individuals. Results: Using a standard alignment-based Genome Analysis Toolkit (GATK), we find 62,000-76,000 additional variants in amplified samples. After application of a unique kmer enumeration and variant detection method (RUFUS), only 38,000-47,000 additional variants are observed in amplified gDNA. This result suggests that roughly half of the amplification-introduced variants identified using GATK may be the result of mapping errors and read misalignment. Conclusions: Our results show that it is possible to obtain informative, high-quality data from exome analysis of whole genome amplified NBS with the important caveat that different data generation and analysis methods can affect variant detection accuracy, and the concordance of variant calls in whole-genome amplified and non-amplified exomes.National Institute of Health P01HD067244, NS076465, R01ES021006Nutritional Science
Predicting synthetic lethal interactions using conserved patterns in protein interaction networks
In response to a need for improved treatments, a number of promising novel targeted cancer therapies are being developed that exploit human synthetic lethal interactions. This is facilitating personalised medicine strategies in cancers where specific tumour suppressors have become inactivated. Mainly due to the constraints of the experimental procedures, relatively few human synthetic lethal interactions have been identified. Here we describe SLant (Synthetic Lethal analysis via Network topology), a computational systems approach to predicting human synthetic lethal interactions that works by identifying and exploiting conserved patterns in protein interaction network topology both within and across species. SLant out-performs previous attempts to classify human SSL interactions and experimental validation of the models predictions suggests it may provide useful guidance for future SSL screenings and ultimately aid targeted cancer therapy development
Repression of transcription at DNA breaks requires cohesin throughout interphase and prevents genome instability
Cohesin subunits are frequently mutated in cancer, but how they function as tumor suppressors is unknown. Cohesin mediates sister chromatid cohesion, but this is not always perturbed in cancer cells. Here, we identify a previously unknown role for cohesin. We find that cohesin is required to repress transcription at DNA double-strand breaks (DSBs). Notably, cohesin represses transcription at DSBs throughout interphase, indicating that this is distinct from its known role in mediating DNA repair through sister chromatid cohesion. We identified a cancer-associated SA2 mutation that supports sister chromatid cohesion but is unable to repress transcription at DSBs. We further show that failure to repress transcription at DSBs leads to large-scale genome rearrangements. Cancer samples lacking SA2 display mutational patterns consistent with loss of this pathway. These findings uncover a new function for cohesin that provides insights into its frequent loss in cancer
Bioinformatics in translational drug discovery
Bioinformatics approaches are becoming ever more essential in translational drug discovery both in academia and within the pharmaceutical industry. Computational exploitation of the increasing volumes of data generated during all phases of drug discovery is enabling key challenges of the process to be addressed. Here, we highlight some of the areas in which bioinformatics resources and methods are being developed to support the drug discovery pipeline. These include the creation of large data warehouses, bioinformatics algorithms to analyse ‘big data’ that identify novel drug targets and/or biomarkers, programs to assess the tractability of targets, and prediction of repositioning opportunities that use licensed drugs to treat additional indications